Automatic text skew estimation in document images
نویسندگان
چکیده
This paper describea an algorilhm to estimate ihe tezt skew angle in a document image. The algorithm utilizes the recursive morphological transforms and yield8 accurate estimates of tezl skew angles on a large document image data set. The algorithm computes the optimal pcrrameter settings on the fly without any human interaction. In 2hM automatic mode, ezperimental resulti indicate that the algorilhm generates estimated tezt skew angles within t&$ of the true tezt skew anglea with a probabilily of Fo process a So&lpi document image, the algorithm takes 10 seconds on SUN spare 10 machines.
منابع مشابه
A Fast and Novel Skew Estimation Approach using Radon Transform
In this paper, an effective and reliable skew estimation technique for machine printed documents and photos using radon transform is proposed and is compared with other methods used for skew estimation such as Fast Fourier Transform (FFT), Hough Transform (HT), combination of Horizontal Projection Profile (HPP) and Hough Transform, combination of Gabor filter and Radon transform, combination of...
متن کاملDocument Analysis And Classification Based On Passing Window
In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...
متن کاملA survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملSkew Estimation by Parts
This paper proposes a new part-based approach for skew estimation of document images. The proposed method first estimates skew angles on rather small areas, which are the local parts of characters, and subsequently determines the global skew angle by aggregating those local estimations. A local skew estimation on a part of a skewed character is performed by finding an identical part from prepar...
متن کاملAn Integrated System for Handwritten Document Image Processing
In this paper we attempt to face common problems of handwritten documents such as nonparallel text lines in a page, hill and dale writing, slanted and connected characters. Towards this end an integrated system for document image preprocessing is presented. This system consists of the following modules: skew angle estimation and correction, line and word segmentation, slope and slant correction...
متن کامل